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Abstract 


We report the complete genome sequences of a buffalo coronavirus (BufCoV HKU26) detected from the faecal samples of two domestic 
water buffaloes (8 ubalus bubalis ) in Bangladesh. They possessed 98-99% nucleotide identities to bovine coronavirus (BCoV) genomes, 
supporting BufCoV HKU26 as a member of Betacoronavirus I. Nevertheless, BufCoV HKU26 possessed distinct accessory proteins 
between spike and envelope compared to BCoV. Sugar-binding residues in the N-terminal domain of S protein in BCoV are conserved in 
BufCoV HKU26. 
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Coronaviruses are classified into four genera, with bat coro- 
naviruses known as the gene source of Alphacoronavirus and 
Betacoronavirus, and avian coronaviruses as the gene source of 
Gammacoronavirus and Deltacoronavirus [1,2]. However, lineage 
A Betacoronavirus is unique among the genus in originating in 
rodents instead of bats [3]. Lineage A Betacoronavirus comprises 
several coronavirus species, including murine coronavirus, hu¬ 
man coronavirus HKUI, Chinese Rattus coronavirus HKU24, 
rabbit coronavirus HKUI4 and Betacoronavirus I [ 3 - 5 ]. Beta¬ 
coronavirus I is best known for its tendency for recombination 
and interspecies transmission among various mammalian spe¬ 
cies [ 3 , 6 , 7 ]. In particular, human coronavirus OC43 (HCoV 
OC43) likely originated from a relatively recent zoonotic 


transmission event, with the most recent common ancestor of 
HCoV OC43 and bovine coronavirus (BCoV) dating to around 
1890 [8]. Besides cattle, BCoV-like viruses have been detected 
in various ungulates, including water buffalo calves with 
gastroenteritis in Italy [ 9 , 10 ]. However, only partial gene se¬ 
quences, the longest one being ~9.6 kb spanning ORFIb to 
nucleocapsid (N), were obtained from the buffalo viruses 
[ 9 , 10 ]. 

We report the complete genome sequences of a buffalo 
coronavirus (BufCoV HKU26) detected from the faecal samples 
of two domestic adult water buffaloes (8 ubalus bubalis ) in 
Bangladesh. RT-PCR for coronavirus detection, and complete 
genome sequencing and analysis were performed as described 
previously [I 1,12]. The genomes of BufCoV HKU26 strains BI - 
24F and BI-28F were 31 021 and 30 975 in length, with G+C 
content of 40%. They possessed 98-99% nucleotide identities 
to the genomes of BCoVs, supporting the classification of 
BufCoV HKU26 as a member of the species Betacoronavirus I 
(Fig. I). The genome organization is also characteristic of line¬ 
age A Betacoronavirus, with the putative transcription regulatory 
sequence motif 5'-C(U/C)AAAC-3' (Fig. I). The two BufCoV 
HKU26 genomes encode five putative accessory proteins 
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FIG. I Phylogenetic tree constructed from complete genomes of BufCoV and other members of Betacoronavirus lineage A (top). Tree was con¬ 
structed by maximum likelihood method using general-time-reversible model including proportion of invariable sites with gamma-distributed sub¬ 
stitution rates and bootstrap values calculated from 100 trees. Betacoronavirus I indicated at right. Boldface type indicates 2 strains of BufCoV with 
complete genomes sequenced in this study. SACoV, sable antelope coronavirus (EF42462I); GiCoV, giraffe coronavirus (EF424623); ACoV, alpaca 
coronavirus (DQ9I5I64); WtDCoV, white-tailed deer coronavirus (FJ425I87); SDCoV, sambar deer coronavirus (FJ425I89); WbCoV, waterbuck 
coronavirus (FJ425I86); BCoV, bovine coronavirus (DQ8I 1784); BufCoV, buffalo coronavirus; DcCoV, dromedary camel coronavirus (KF906249); 
CRCoV, canine respiratory coronavirus (JX860640); HCoV OC43, human coronavirus OC43 (AY39I777); PHEV, porcine haemagglutinating 
encephalomyelitis virus (DQ0I 1855); ECoV, equine coronavirus (EF4466I5); RbCoV HKUI4, rabbit coronavirus HKUI4 (JN874559); ChRCoV, 
China Rattus coronavirus HKU24 (KM349742); HCoV HKUI, human coronavirus HKUI (AY5970I I); MHV, murine hepatitis virus (FJ647223); RCoV, 
rat coronavirus (FJ938068). Genome organization of BufCoV (bottom). Position of transcriptional regulatory sequences of each gene is indicated. 
ORFs between spike (S) and envelope (E) gene are magnified to show differences between two BufCoVs. ORFlab are represented by green boxes. 
Haemagglutinin-esterase (HE), S, E, membrane (M) and nucleocapsid (N) are represented by red boxes. Putative accessory proteins are represented by 
blue boxes. 
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conserved among most members of Betacoronavirus I, including 
a 32 kDa protein between ORFlab and haemagglutinin- 
esterase, three proteins between spike (S) and envelope (E) 
and one internal protein overlapping with N. In BCoV, the 
three proteins between S and E were of typical size: 4.9, 4.8 and 
12.8 kDa respectively. In BufCoV, the 4.9 kDa protein is 
replaced by a 2.9 kDa protein (25 aa) due to a premature stop 
codon. The 4.8 kDa protein of BufCoV HKU26 was also 
different from that of BCoVs, with a smaller protein of 29 aa in 
strain BI-24F and a protein of 44 aa in strain BI-28F due to a 
frameshift mutation caused by a single nucleotide deletion after 
the original start codon. Compared to other BCoV-like viruses, 
BufCoV HKU26 possessed a unique serine—>asparagine sub¬ 
stitution at position 354 of N protein. In contrast to murine 
coronavirus, which utilizes carcinoembryonic antigen-related 
cell adhesion molecule I as a receptor, BCoV and HCoV OC43 
bind to N-acetyl-9-O acetyl neuraminic acid for cell entry 
[13,14]. All the known critical and noncritical sugar-binding 
residues in the N-terminal domain of S protein in BCoV are 
conserved in BufCoV HKU26 [15]. The genome sequences of 
BufCoV HKU26 have been lodged at GenBank under accession 
numbers KU558922 and KU558923. 
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